Physica A: Statistical Mechanics and its Applications
○ Elsevier BV
Preprints posted in the last 30 days, ranked by how well they match Physica A: Statistical Mechanics and its Applications's content profile, based on 10 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.
Hwang, C.-K.; Chen, Y.-W.; WANG, Y.-T.; Ho, T.-S.; Oyang, Y.-J.
Show abstract
BackgroundDengue has been a major health threat globally in recent years. In particular, dengue incidences continue to increase annually and the epidemic area has expanded primarily due to global warming. Therefore, effective case detection and surveillance strategies are crucial to tackle this global health challenge. In clinical practice, the rapid test kit detecting dengue non-structural protein 1 antigen and commonly referred as NS1, is widely employed for early diagnosis. However, real-world studies revealed that the sensitivity of the NS1 test kit ranged from approximately 61% to 95%. Since early diagnosis is really critical for disease surveillance in the early stage of a dengue epidemic, scientists have been working hard to develop novel diagnosis methods that can provide higher sensitivity levels. Methodology/Principal FindingsIn response to this challenge, in this study, we have developed a novel diagnosis procedure that integrates machine learning technologies with the NS1 test kit. Our experimental results revealed that we would be able to raise the sensitivity of the dengue diagnosis procedure to higher than 99% by incorporating machine learning based prediction models to screen the suspected patients with a negative NS1 result. Furthermore, the relative risks between the suspected patients who were predicted to be positive and those who were predicted to be negative exceeded 4.8. Conclusions/SignificanceThese results illustrate that the proposed approach provides an effective and efficient diagnosis procedure to address the global health challenge caused by spread of dengue. Author SummaryThis study has aimed to enhance surveillance of the dengue disease by integrating machine learning technologies with the rapid test kit commonly employed in early diagnosis. In clinical practice, the NS1 rapid test kit is widely employed for early diagnosis. However, real-world studies revealed that a certain percentage of the patients with a negative NS1 test result, ranging from 5% to 39%, were actually infected by dengue. Since early diagnosis is critical for disease control in the early stage of a dengue epidemic, scientists have been working hard to tackle this challenge. Based on this observation, this study was launched to investigate the effects of incorporating machine learning based prediction models to further screen those patients with a negative NS1 test result. The experimental results revealed that the proposed approach was able to identify over 99% of the patients who were infected by the dengue disease. Furthermore, the risk of the suspected patients who were predicted to be positive was 4.8 times higher than the risk of those who were predicted to be negative. The experimental results illustrate that the proposed approach provides an effective and efficient diagnosis procedure to enhance surveillance of the dengue disease.
Jaeger, K. H.; Tveito, A.
Show abstract
The Poisson-Nernst-Planck (PNP) system is an accurate model of electrodiffusion of ionic species. It is commonly used in situations where nanoscale resolution is required, for instance close to ion channels in the membranes of biological cells. The inherent stiffness of the equations has made them challenging to solve and has limited the applicability of the system. In particular, the time step required for stable solutions has typically needed to be very short (nanoseconds), which makes simulations on the time scale of an action potential (milliseconds) difficult. Recently, it has been observed that avoiding operator splitting and instead solving the concentration equations and the electrostatic equation in a coupled manner relaxes the time-step limitation considerably. However, no theoretical explanation of this observation has been provided. Here, we aim to explain why the coupled scheme allows much larger time steps. We illustrate the mechanism by considering special cases that define necessary, but not sufficient, conditions for stability. We also show that these conditions remain relevant for the fully coupled PNP model in 3D.
Cresson, J.; Pere, M.; Szafranska, A.
Show abstract
This work focuses on the global and partial identification problem for fractional differential equations. We provide a general numerical procedure based on global and local optimization algorithms with two refinements for biological systems that ensure solution positivity and homogeneous parameter units. The method is applied to a new fractional model of Dengue outbreak called the Fractional Homogeneous Nishiura (FHN) model, calibrated using data of newly infected people in Cape Verde. We show that our identification method yields a better fit between data and model solutions than previous approaches and that our FHN model captures the dynamics of Dengue more closely than existing systems.
Sajjad, M.
Show abstract
Artificial intelligence (AI) tools have been rapidly adopted by medical researchers, yet whether early career researchers in low and middle income countries possess the awareness and habits needed to use these tools safely remains poorly documented. This study characterized AI adoption patterns, hallucination awareness, and verification and disclosure practices among early career medical researchers in Pakistan. A cross sectional anonymous online survey was conducted among medical students, house officers, residents, physicians, and faculty involved in research or academic work across Pakistan (May 2026). Descriptive statistics and chi square tests were applied to 373 eligible responses. AI use was near universal (99.7%), with 60.3% using AI tools daily. The most commonly reported tool in this sample was Claude (40.5%), followed by ChatGPT (29.2%) and Perplexity (26.0%), though this ranking likely reflects sampling characteristics. Despite high adoption, 59.2% typically did not verify AI outputs before use, and 40.2% had never heard that AI can generate fabricated scientific references. In behavioral vignettes, 36.5% assumed convincing AI generated references were authentic, and 54.2% would continue using remaining AI content after discovering one fabricated reference. Formal research training was strongly associated with consistent disclosure (51.7% vs. 17.1%; chi square=48.43, p less than 0.001). Role, daily use frequency, and research training were not significantly associated with verification behavior. Early career medical researchers in Pakistan demonstrate high AI adoption alongside incomplete hallucination awareness and infrequent verification, a pattern that may carry implications for research integrity. Formal training was the only factor significantly associated with consistent disclosure. Integration of AI literacy into medical curricula and institutional governance frameworks merits consideration.
Shah, A.; Mehta, A.; Bhensdadia, C. K.
Show abstract
Mental health challenges among university students have increased due to academic pressure, lifestyle changes, and continuous digital engagement. Existing approaches for mental health assessment often rely either on self-reported psychological scales or isolated behavioral indicators, limiting their ability to capture complex temporal and contextual patterns. This study proposes an interpretable multimodal framework for student mental health risk assessment using behavioral sensing, academic information, ecological momentary assessments (EMA), and psychometric survey data. A bidirectional Long Short-Term Memory autoencoder is employed to learn latent temporal representations from day-level behavioral sequences, while graph embeddings capture structural relationships among students using similarity-based neighborhood graphs. These representations are fused with academic and survey-derived features and reduced using Principal Component Analysis and Uniform Manifold Approximation and Projection. K-means clustering is then applied to identify behaviorally distinct student groups. Experimental analysis on the StudentLife dataset demonstrates meaningful clustering performance with a Silhouette Score of 0.4209 and Adjusted Rand Index stability of 0.6869. The identified clusters correspond to low-risk, moderate-risk, and high-risk behavioral profiles. To improve interpretability and practical usability, a fuzzy inference system is introduced to compute mental risk, academic risk, and wellbeing indices using psychometric indicators including PHQ-9, PSS, PANAS, VR-12, and Big Five personality traits. The results demonstrate the potential of combining multimodal behavioral modeling with interpretable fuzzy reasoning to support early mental health risk assessment in educational settings.
Bhakat, S.
Show abstract
Wild-type T4 lysozyme (T4L) is used as a benchmark to evaluate conformational sampling across generative AI, AI-accelerated molecular simulation (AMS), and physics-based enhanced molecular dynamics (EMD). A four-state model: exposed/open, exposed/closed, buried/open, and buried/closed; is defined using physically meaningful collective variables. While generative AI methods (AF-cluster, MSA subsampling of AlphaFold2, ConforFold, AlphaFlow, ESMFlow, ConfRover, BioEmu) largely sample only the exposed/open state, AMS integrating generative ensembles with iterative molecular dynamics, recovering all states and reproducing equilibrium populations similar to EMD and experimental smFRET signatures.
Bartling, B. A.
Show abstract
University Students are particularly vulnerable to disordered eating behaviors (DEB) and attitudes (DEA). This study expands upon the knowledge base of DEA and DEB in university students by employing a netnography as a precursor to the main study to establish the following research questions: What is the relationship between the perceived quality of dining services and DEA? What is the relationship between the perceived availability of dining services and DEA? And lastly, how does prior experience with dining services affect eating patterns and attitudes toward food? The first study utilized a netnographic approach in order to evaluate issues with university dining services, leading to the design of the second study. Students at an upper Midwestern university (n=88) were surveyed via convenience sampling. Eating attitudes, eating behaviors, and relationships with dining services were measured. A statistically significant relationship between the availability of services and the DEA was found. A statistically significant relationship between the availability of services and risk behaviors was found. However, no statistically significant correlation existed between first-year dependence on on-campus dining services and risk behavior related to eating disorders or eating attitudes. Based on this, we know the quality of nutrition and the availability of services impacted students eating attitudes and behaviors, not inherent dependence.
Alsaiari, A.; Turki, T.; Taguchi, Y.-h.
Show abstract
Ovarian cancer is one of the gynecological cancer types, which, if metastasized and not detected early, can cause deaths among women. Therefore, there is a need to accurately predict drug responses to ovarian cancer. A gynecological pathologist inspects abnormality in tissues, followed by providing a report about patients; however, such a diagnostic process is (1) hard; (2) requires experience; and (3) time consuming. Moreover, existing tools are far from perfect. Hence, we present a computational pipeline to improve predicting drug response pertaining to ovarian cancer, derived as follows. First, we download digital pathology images pertaining to ovarian bevacizumab response from the cancer imaging archive repository. We employed histogram of oriented gradients to images, constructing feature vectors, provided to Fisher linear discriminant analysis to change the representation through dimensionality reduction. Then, we provide reduced-dimensionality data for regression analysis through support vector regression coupled with various kernels and calculating the area under the ROC curve (AUC). Experimental results against transformer-based models (ViT and Swin) and other deep learning (DL) models (VGG16, ResNet50, InceptionV3, MobileNetV2, and EfficientNetB6) demonstrate that our approach with radial kernel (named SVRD+R) yielded an AUC performance improvements of 17% against the best-performing transformer-based model (ViT) while obtaining an AUC performance improvements of 14.9% when compared against the best DL-based model (MobileNetV2). These results demonstrate the superiority and feasibility of our AI-based pipeline when tackling prediction problems pertaining to gynecologic cancer studies. MSC92B05; 68T09
Sozol, S. S.; Dev Nath, B. C.; Fahim, F. M. S.; Suzana, N. N.; Mirza, J. F.; Ahmmed, S.; Zohra, F.-T.; Zafr, A. H. A.; Uddin, M. N.; Mondal, M. R. H.; Hoque, A. S. M. L.
Show abstract
Machine learning (ML) is being considered to help diagnose cardiovascular diseases (CVD). Still, challenges like inconsistent and limited datasets, limited infrastructure, and global inequalities lead to the need for a reliable and practicable ML solution. This paper presents an ML-driven framework for predicting CVD risk scores and classifying status. Several data preprocessing techniques, including multiple imputation by chained equations (MICE), outlier removal, are considered. In addition, hyperparameter tuning is performed with the GridSearchCV tuning technique. Moreover, a consensus-driven five-feature selection method is applied to identify optimal predictors. The dataset used in this study contains healthcare records related to future CVD risk scores, comprising 1,529 patient records with 22 features. The optimized stacked ensemble model is applied to the dataset and achieves a cross-validated coefficient of determination value of 98.13% for CVD risk score regression. Comparative evaluation with other ML models confirmed improved accuracy, efficiency, and interpretability. The explainable AI technique SHAP is applied to interpret predictions and highlight key risk factors. Moreover, a deployment-ready web platform with multi-role access has been developed that demonstrates clinical applicability. The proposed framework offers a reliable and interpretable tool for early detection of CVD and personalized risk assessment. In the future, this work can be extended to integrate longitudinal data, medical imaging, and deep learning to improve generalizability and strengthen real-world impact.
Sasai, M.; Fujishiro, S.
Show abstract
When three cyanobacterial proteins--KaiA, KaiB, and KaiC--are incubated with ATP in vitro, the phosphorylation level of KaiC exhibits stable circadian oscillations. Biochemical and structural analyses have shown that KaiCs ATPase activity is crucial for these oscillations, leading to the hypothesis that ATP-consuming dynamics function as a molecular clock, determining the oscillation period of individual molecules. Moreover, these molecular clocks synchronize with one another, resulting in collective oscillations at the ensemble level. In this study, we develop a theoretical model to test this molecular clockwork hypothesis. Our model clarifies the relationship between the oscillation period and ATPase activity, explaining the significant changes in the period induced by amino-acid substitutions near the CI-CII domain boundary of the KaiC hexamer. Furthermore, the model addresses the physical basis for temperature compensation concerning both the oscillation period and ATPase activity. Thus, the molecular clockwork perspective provides a framework for understanding the atomic design behind collective oscillations.
Rao M, S.; Khezrimotlagh, D.
Show abstract
Non-invasive wrist pulse monitoring has been integrated into various medical systems for cardiovascular assessment. However, different definitions of pulse transit time are used in the literature, and their statistical behavior when measured locally at the wrist using pressure sensors has not been systematically examined. Wearable wristbands designed to measure pulse transit time (PTT) have emerged as valuable tools for evaluating cardiac activity. While several algorithms have been developed to predict blood pressure using PTT, it is well recognized that PTT and its inverse parameter, pulse wave velocity (PWV), exhibit temporal variability. In this study, PTT was explicitly measured at the wrist's radial artery to investigate its statistical variation and relationship with different arterial pressures. The experiment exhibits two distinct methodologies for PTT computation using onset-based and peak based measurements. Data were recorded across five cuff pressure levels at 20, 40, 60, 80, and 100 mmHg using the pulse pressure sensor (PPS). PTTonset time shows lower coefficient of variation as compared to PTTpeak time within the 100 mmHg pressure range. The weak correlation coefficient is recorded between PTT values. However, dynamic time warping (DTW) analysis revealed a notable similarity in the time series of PTTonset and PTTpeak, regardless of the applied pressure level. For the multi participant dataset, the mean DTW distances ranged from 0.029 to 0.046 across the tested cuff pressures, illustrating consistent similarity between PTTonset and PTTpeak over time. The objective of this study is to examine the statistical behavior, stability, and temporal similarity of the two commonly used PTT definitions when measured at the radial artery using pressure sensors. Statistical analysis shows consistent differences between the two PTT definitions across participants. PTTonset shows lower variation than PTTpeak. However, PTTpeak requires simpler computation and produces fewer detection errors, while PTTonset provides lower statistical variation.
Reingruber, J.; Paquin-Lefebvre, F.
Show abstract
A major challenge in neuroscience is to predict how currents in nanodomains affect voltage and ionic concentrations. Cable and Rall theory provide analytic current-voltage relations by neglecting concentration gradients, and the impact of concentration gradients is usually studied numerically with the Poisson-Nernst-Planck (PNP) model. A precise quantitative understanding of the combined dynamics remains limited because analytic current-voltage-concentration relations are missing. In this work we derive such relations using a novel approach based on cross-diffusion equations. For narrow cylindrical domains, we derive time-dependent and steady-state expressions that explicitly show how currents affect voltage and ionic concentrations. We find that the influx of only one ion can significantly change the concentrations of all the other ions even if no channels for these ions are present. After a current injection we compute a biphasic voltage transient where the small-time asymptotic corresponds to the steady-state solution of the cable equation. We show that the accuracy of cable theory prediction for the voltage depends on how the current is distributed among the various ions. Finally, we develop an iterative method to accurately compute steady-state profiles for voltage and concentrations using first-order results by subdividing a cylinder into small segments.
Gogo, J. A.; Wanyonyi, M.
Show abstract
Malaria remains a major public health challenge in sub-Saharan Africa, with pronounced spatial and temporal variation in transmission intensity that complicates effective control strategies. Accurate classification of transmission states is essential for guiding targeted interventions and strengthening early warning systems. This study develops a machine learning framework for the classification of malaria transmission states in Kenya using monthly panel data from 47 counties spanning the period 2015 to 2025. Transmission was categorised into four operationally relevant states based on incidence thresholds. Four supervised learning models, namely multinomial logistic regression, random forest, extreme gradient boosting, and support vector machine, were trained using temporally lagged features and evaluated under a forward chaining validation scheme to preserve temporal structure. Model performance was assessed using accuracy, macro averaged F1 score, Matthews correlation coefficient, and Brier score, complemented by calibration analysis. Extreme gradient boosting achieved the best overall performance, with accuracy of 0.9918, macro averaged F1 score of 0.9647, and Matthews correlation coefficient of 0.9831, alongside the lowest Brier score of 0.0031, indicating highly reliable probability estimates. Feature importance analysis revealed that lagged incidence, vegetation index, precipitation, and insecticide treated net coverage were the most influential predictors. Partial dependence analysis demonstrated nonlinear relationships and clear seasonal patterns in transmission dynamics. The findings show that machine learning approaches can accurately classify malaria transmission states while providing interpretable and well calibrated outputs for decision making. This framework offers a practical tool for supporting malaria surveillance and resource allocation. Further validation in different epidemiological settings is recommended to assess generalisability.
Terada, K.; Kondo, Y.
Show abstract
Mechanical properties of epithelial tissues play essential roles in morphogenesis and physiological function. In this study, we analytically derived the in-plane bulk modulus, shear modulus, and Poissons ratio of a three-dimensional cell vertex model of epithelial monolayers. We showed that the model can robustly reproduce a near-zero in-plane Poissons ratio, a mechanical feature reported in cultured epithelial tissues. Numerical simulations further confirmed that the theoretically predicted Poissons ratio accurately describes the response of the model under finite, biologically relevant strains. In addition, the model exhibits not only morphological bistability between squamous-like and columnar-like states, but also mechanical bistability characterized by distinct elastic responses. Together, these results provide a minimal three-dimensional framework that links cell-scale mechanical interactions and epithelial morphology to tissue-scale elastic properties.
Bustamante, C. J.
Show abstract
Despite progress in predicting protein structures, how proteins arrive at their native state remains a subject of continuous debate. We present a single molecule force spectroscopy study of the unfolding and refolding intermediates of the conserved, diverse, and ancient Rossmann2x2 fold ({beta}12{beta}34{beta}56{beta}78). By inserting glycines at different locations in the protein, we can follow in real time and annotate its unfolding and refolding intermediates. This protein folds along a single reversible pathway involving the ordered and sequential organization of discrete and cooperative folding units or foldons: unfolded {rightleftarrows} {beta}12{beta}3 {rightleftarrows} {beta}12{beta}34{beta}5 {rightleftarrows} {beta}12{beta}34{beta}56{beta}7 {rightleftarrows} {beta}12{beta}34{beta}56{beta}78. This strict order results from the formation of an autonomously folding unit (primary foldon) and the subsequent organization of elements (secondary foldons) whose stability depends on their interactions with previously organized ones.
Lehmann, L.
Show abstract
Darwinian fitness is equated here with invasion fitness and defined as the quantity determining the fate--certain extinction or possible spread--of a single mutant type. We derive it, together with its phenotypic derivative, for evolution in group-structured populations under limited genetic mixing, where the demography of the focal species and its environment is modeled as a discrete-time stochastic process. Reproduction, physiological development, dispersal, and survival are influenced by interactions within and between groups and by environmental fluctuations within and across generations. Using multitype branching processes in random environments, we show that invasion fitness is predicted by a stochastic growth rate that can be represented biologically in two meaningful genealogical ways. First, as the long-term geometric mean of the expected per-capita number of mutant copies produced per time step by a representative member of the mutant lineage. Second, as the the long-term geometric mean of the expected reproductive-value-weighted per-capita number of mutant copies produced by such an individual. This latter representation is useful for computing the phenotypic directional derivative of invasion fitness. Moreover, this derivative can be written as an actor-centered inclusive-fitness effect derived from properties of the resident population process. This effect depends on class-specific fitness differentials, relatedness, reproductive values, and class frequencies. However, unless generation- and class-specific fitness defines a stochastic matrix, the derivative does not separate stochastic reproductive values from relatedness and class frequencies, and must be evaluated by simulations. In summary, we formalize invasion fitness biologically quite generally and show how Hamiltons marginal rule is deduced from it.
Crona, K.; Greene, D.
Show abstract
AO_SCPLOWBSTRACTC_SCPLOWGene interactions play an important role in the development of antimicrobial drug resistance and other evolutionary processes of medical importance. Empirical studies have revealed multiple peaks, inaccessible trajectories, and constraints on mutation order. Higher order epistasis is associated with obstacles in fitness landscapes. However, its significance has been debated in recent years, sometimes through reinterpretations of data from previous publications. We suggest that local higher order interactions may help reconcile these seemingly contradictory findings. Rank order based methods can be informative when other methods fail to detect consequential interactions. In addition to conventional rank order methods, including sign epistasis, we introduce signed bipyramids. A bipyramid interaction compares extreme genotypes against their intermediates, for example a triple mutant and the wild-type against the corresponding single mutants. In general, interactions are signed if they are implied by the rank order alone.
Prakash, D. L.; Banerjee, A.; Gosavi, S.
Show abstract
Coarse-grained structure-based models (CG-SBMs; or G[o] models) are simplified potential energy functions of biomolecules or biomolecular complexes that encode their structure. Molecular dynamics simulations of such SBMs have been successfully used to study long time-scale dynamics such as protein and RNA folding, and large conformational transitions of biomolecular complexes. SBMs have several advantages: (1) Their MD simulations are computationally inexpensive, making extensive sampling easily accessible to many researchers. (2) They are easy to modify and can be adapted for the specific biomolecular problem that needs to be investigated. However, the force-fields of SBMs are not usually included in commonly used biomolecular simulation packages resulting in a barrier to their use. Here, we present SuBMIT (Structure Based Models Input Toolkit; https://github.com/sglabncbs/submit), a toolkit for generating coarse-grained SBM input files for performing MD simulations with GROMACS and OpenMM/OpenSMOG. Simulations whose input files can be generated using the different flavors of CG-SBMs present in SuBMIT include the folding and conformational ensembles of proteins with intrinsically disordered regions, 3D-domain-swapping in proteins and the dynamics of RNA-protein assemblies (e.g., simple RNA viruses).
Skjegstad, L. E. J.; Oud, S.; Vroomans, R. M.; Kirkegaard, J. B.
Show abstract
Vertex models are widely used within the field of developmental biology to study tissue morphogenesis. These models are well-suited for modeling deformation at the cellular level where movement is driven by local forces. However, understanding how these microscopic movements coordinate to yield macroscopic phenomena such as the shapes of entire tissues remains a challenge. Here we study a top-down approach using differentiable programming on a simplified vertex model of a laminar tissue, and investigate whether the attributes of individual cells can be tuned to make the mesh as a whole acquire a predefined shape. We let the mesh evolve according to simple rules defined by the input to each polygon, and evaluate the resulting shape against a target boundary. Additionally, we show how the high degeneracy of the output can be reduced by constraining the polygon distributions: first, by adding simple penalties on tissue-wide attributes; and second, by dividing the tissue into regions, within which we bias the attributes toward characteristic values. Our study shows how a simple vertex model can be combined with differentiable programming to model developing tissues, and provides insight into the way individual cells must coordinate to yield macroscopic phenomena such as pre-programmed shapes.
Chuma, A. T.; Wang, C.; Voigt, J.-u.; Mekonnen, D.; Asmare, M. H.; Vanrumste, B.
Show abstract
Rheumatic heart disease (RHD) remains a major public health concern across low- and middle-income countries in the Global South. Early detection through community-based screening of asymptomatic individuals has been identified as a critical strategy for reducing the disease burden. Despite this, the absence of accessible, automated population screening tools continues to impede implementation at scale. This study investigates the screening potential of integrating electrocardiography (ECG) and phonocardiography (PCG) for the early detection of RHD in asymptomatic schoolchildren. The dataset was obtained as part of an ambulatory screening initiative conducted across multiple school sites in rural areas of Ethiopia. It comprised ECG and PCG recordings from 611 asymptomatic schoolchildren aged 10 to 20 years. A comprehensive set of time-frequency, visibility graph and non-linear features were extracted from both signal modalities. These features were subsequently evaluated using machine learning models to assess their utility in the automated screening of early RHD. The best model achieved an average 10-folds cross-validation scores on sensitivity, positive-predictive-value and F1-score of 59.6%, 63.6% and 60.8%, respectively for multimodal ECG and PCG signals. Whereas separate evaluation of ECG showed an F1-score of 61.1% and PCG achieved 23.5%. Key features included the T-wave, the area under the QRS complex, and entropy measures derived from beat visibility graphs in the ECG. In addition, visibility graph features from multi-band S1 and S2 heart sound segments, along with MFCC coefficients from the PCG, were also relevant. However, PCG alone performed poorly and did not show improved results over the ECG features. Although auscultation is key clinical diagnosis tool in symptomatic RHD, combined PCG with ECG features does not enhance asymptomatic RHD detection using the ECG modality alone.